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Abstract 

We obtain upper bounds for the total variation distance between the distributions 
of two Gibbs point processes in a very general setting. Applications are provided to 
various well-known processes and settings from spatial statistics and statistical physics, 
including the comparison of two Lennard- Jones processes, hard core approximation of 
an area interaction process, and the approximation of lattice processes by a continuous 
Gibbs process. 

Our proof of the main results is based on Stein's method. We construct an explicit 
coupling between two spatial birth-death processes to obtain Stein factors, and employ 
the Georgii-Nguyen-Zessin equation for the total bound. 

Keywords: Conditional intensity, pairwise interaction process, birth-death process, 
Stein's method, total variation distance. 
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1 Introduction 

Gibbs processes form one of the most important classes o f point processes in spatial statis 



tics that may incorporate dependence between the points (jMetller and Waagepetersenl . 12004 



Chapter 6). They are furthermore, mainly in the special guise of pairwise i nteraction pro 



cesses, one of the building blocks of modern statistical physics (iRuelld . If969l ) . 

Up to the somewhat technical condition of hereditarity, see Section 2, a Gibbs process on a 
compact metric space X is simply a point process whose distribution is absolutely continuous 
with respect to a "standard" Poisson process distribution. It is thus a natural counterpart 
in the point process world to a real-valued random variable that has a density with respect 
to some natural reference measure. A notorious difficulty with Gibbs processes is that in 
most cases of interest their densities can only be specified up to normalizing constants, which 
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typically renders explicit calculations, e.g. of the total variation distance between two such 
processes, difficult. 

In the current paper we give for the first time a comprehensive theorem about upper 
bounds on the total variation distance between Gibbs process distributions in a very general 
setting. These bounds provide natural rates of convergence in many situations, and include 
explicit constants, which are small if one of the Gibbs processes is not too far away from a 
Poisson process. 

For the important special case of bounding the distance between two pairwise interaction 
processes Hi and H 2 on X C R D with densities proportional to 0^ Yl{ x y }cs, ~ v) an d 
x >y }c£ ^ 2 ( X ~~ y)> respectively, where </?i and <f 2 are pairwise interaction functions that 
are bounded by one (inhibitory case), a consequence of our results is that there is an explicitly 
computable constant C = C(j3,cp2) > such that 



d TV (^(E 1 ),^'(E 2 ))<C\\ ( p 1 - V2 \\ L i. 



'1.1: 



If we relax the condition that the pairwise interaction functions are bounded by one and 
require suitable stability conditions for Si and 2 2 instead, we still obtain 



d TV (^(E 1 ),Jf{E 2 )) < C(e)\\<px - p 2 |Ui +e, 



1.21 



where e can be chosen arbitrarily small, causing a bigger C(e). We give more explicit 
examples for Strauss, bi-scale Strauss, and Lennard- Jones type processes in Sections [3] and HI 

For our proof of the main results we develop Stein's m ethod for Gibbs process approxi- 
mation. Using the generator approach by IBarbourl (119881 ). we re-express the total variation 
distance in terms of the infinitesimal generator of a spatial birth-death process (SBDP) whose 
stationary distribution is one of the Gibbs process distributions involved. An upper bound 
is then obtained by constructing an explicit coupling of such SBDPs in order to obtain the 
so-called Stein factor and applying the Georgii-Nguyen-Zessin equation. 

Previou sly Stein's method has been applied very successf ully for Poisson process approxi- 



mation (see 



Barbour and Brownl . Il992l . I Chen and Xial . 120041 and ISchuhmacherl . 120091 ) . Other 



notable developments in the domain o f point process approximatio n concentrate on Com- 
pound Poisson process approximation ( IB arbour and Manssonl . 120021 ) and on approximation 
by certain point processes whose points are i.i .d. given their total n umber (called polyno- 



mial birth-death proceses by the authors, see Kia and Zhangi . 120121 ). In the latter article 
the authors give substantially improved bounds when replacing approximating Poisson or 
Compound Poisson processes by their new processes. However, these new processes are by 
no means flexible enough to approximate processes typically encountered in spatial statistics, 
where truely local point interactions take place, such as mutual inhibition up to a certain 
(non- negligible) distance- 
In iBarbour and Chen! ( 120051 ) the editors write in the preface: "Point process approxima- 
tion, other than in the Poisson context, is largely unexplored". This statement still remains 
mostly true today, and the present paper makes a substantial contribution in order to change 
this. 

Apart from approaches by Stein's method the authors are not aware of any publications 
that give bounds for a probability metric between Gibbs processes in any generality. There 
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is however related work by iMollerl (119891 ) and iDai Pra and Postal (120121 ) , where convergence 
rates for distances between an SBDP and its stationary point process distribution were 
considered. 

The plan of the paper is as follows. We start out in Section [5] by giving the necessary 
definitions and notation, including a somewhat longer introduction to Gibbs and pairwise 
interaction processes. Section [3] contains the main results. While Subsection 13.11 treats the 
common case where the approximating Gibbs process satisfies a stronger stability condition, 
Subsections 13.21 and 13.31 lay out a strategy and give concrete results under very general 
conditions. Simpler examples are scattered throughout Section [31 while Section 0] looks 
at the three more involved applications mentioned in the abstract. In Section Owe discuss 
spatial birth-death processes and present the coupling needed for obtaining the Stein factors, 
and in Section [6] we develop Stein's method for Gibbs process approximation and give the 
proofs of the main results. The paper finishes by an appendix that justifies the reduction of 
our main proofs to a state space with diffuse reference measure a. 



2 Prerequisites 

Let (X, d) be a compact metric space, which serves as the state space for all our point 
processes. We equip X with its Borel a-algebra B = B(X). Let a ^ be a fixed finite 
reference measure on (X, B). If X has a suitable group structure, a is typically chosen to be 
the Haar measure. If X C M D , we tacitly use Lebesgue measure and write \A\ := Leb D (A). 
Also, unless specified otherwise, we assume d to be the Euclidean metric in this case and 
write o>d = n D / 2 jY{D /2 + 1) for the volume of the unit ball. 

Denote by (*Tt,jV) the space of finite c ounting measures ( "point configurations") on X 
equipped with its canonical cr-algebra (see iKallenbergl . Il98fil . Section 1.1). For any £ G 



write |£| = £(X) for its total number of points. A point process is simply a random element 
of 01. 

For a finite measure A on X recall that a point process H is called a Poisson process with 
expectation measure A if the point counts H(Aj), 1 < i < n, are independent Po(A(v4j))- 
distributed random variables for any pairwise disjoint sets Ai,...,A n G B. It is a well 
known fact that such a Poisson process may be constructed as H = ^2%=i > where N is a 
Po(A(A'))-distributed random variable, and Xi are i.i.d. random elements of X with distri- 
bution X(-)/X(X) that are independent of N. We denote the Poisson process distribution 
with expectation measure a by Poi. We will make extensive use of the fact that for H ~ Poi 
and any measurable function h : — > M + we have 



Eh(l 



^ k=0 ' "PC 



where we interpret the summand for k = as h($), writing for the empty point configura- 
tion. Equation (12. ip is obtained by conditioning on the total number of points of H. 
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2.1 Gibbs processes 

We use the definition of a Gibbs point process from spatial statistics. Call a function u : — > 
R+ hereditary if for any £, 77 G 9t with £ < 77, we have that = implies tt(r/) = 0. 

Definition 2.1. A point process 5 on X is called a Gibbs process if it has a hereditary 
density u with respect to P01. 

Gibbs processes form important models in spatial statistics. The specification of a density 
allows us to model point interactions in a simple and intuitive way. Under rather flexible 
conditions, e.g. log-lineari ty in the parameters, forma l infer ence (partly based on numerical 



methods) is possible. See iMoller and Waagepetersenl (120041 ). Chapter 9. 



It will be convenient to identify a Gibbs process by its conditional intensity. 
Definition 2.2. Let 5 be a Gibbs process with density u. We call the function A(- | ■) : X x 

the conditional intensity (function) of E. For this definition we use the convention that 
0/0 = 0. 

Note that other definitions of the conditional intensity in the literature may differ at 
pairs (x,£) with x G £. It is well-known and with the help of Equation (12. ip straightforward 
to check that the conditional intensity is the ex (g) S£ (S)-almost everywhere unique product 
measurable function that satisfies the Georgii-Nguyen-Zessin equation 

h{x,E-5 x ) E{dx)J = j E(h{x,E)\{x\E)) a{dx) (2.3) 

for every measurable h : X x 9T — > M+ . 

A Gibbs process is usually specified via an unnormalized density u that is shown to 
be Poi-integrable. Typically the integral and hence the normalized density u cannot be 
computed explicitly. On the other hand the conditional intensity can be calculated simply 

as 

A(*U) = 5<i±« (2 .4) 

and has a nice intuitive interpretation as the infinitesimal probability that E produces a (fur- 
ther) point around x given it produces at least the point configuration £. Also it determines 
the Gibbs process distribution completely, since an unnormalized density u can be recovered 
recursively for increasingly large point configurations by employing ( 12. 4p . We denote by 
Gibbs(A) the distribution of the Gibbs process with conditional intensity A = A(- 1 ■). 

The measure A given by \(A) = E(H(A)) for any A G B is called the expectation measure 
of E, provided that it is finite. By Equation ( 12.31) we have 

A(A)=E^y l{xeA} E(dx^j = J l{x G A}E(\(x \ E)) ct(dx), 
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i.e. A is absolutely continuous with respect to a. We call its density X(x) = E(A(x | 5)) 
the intensity of 5. We use notation A(-) and A(- | ■) to distinguish the intensity and the 
conditional intensity if necessary. 

In the main part of the paper we distinguish between the approximated Gibbs process 
H with a general conditional intensity u, and the approximating Gibbs process H, whose 
conditional intensity A will typically (except in Sections 13. 2H4.ip satisfy the stability condition 

sup / X(x | £) a(dx) < oo. (S) 



Note that this condition follows from the local stability condition 



A(x|0 < ip*(x) 



for an integrable function ip*: X — \ M.+. Local stability is satisfied for many point proces s 
distributions traditionally used in spatial statistics. See iMoller and Waagepetersenl ( 12004 ). 
p. 84 ff. 



2.2 Pairwise interaction processes 

A special type of Gibbs processes that are noteworthy both for their relative simplicity and 
their abundant use in statistical physics are the pairwise interaction processes. We treat 
distances between such processes in detail in Sections [3] and HJ 

Definition 2.3. A Gibbs process H on X is called a pairwise interaction process (PIP) if 
there exist /3 : X — > M. + and symmetric <p : X x X — > IR + such that H has the unnormalized 
density 

= 11 P( Xi ) 1 1 vi.Xi.Xj) 

l<i<n l<i<j<n 

for any £ = Y^=i ^ The normalizing constant is usually not analytically computable. 
We then denote the distribution of H by PIP(/3, ip). The PIP is called inhibitory if ip < 1. It 
is called /lord core with radius 5 > if tp(x,y) = whenever d(x,y) < 5. 

The conditional intensity of H ~ PIP(/3, ip) is accordingly given by 

n 

K x \Cl = P(x)Y[tp(x,Xi). 

1=1 

For u to be integrable with respect to Poi, it is by Equation (12. ip necessary that (3 is 
integrable. For inhibitory PIPs this is obviously also sufficient. The same holds for hard core 
PIPs with bounded (p, because by the compactness of X their total number of points is almost 
surely bounded. For more general PIPs the situation is not so simple (see Example 13.191 for 
a special case). We will then assume the following conditions. 

(RS) Ruelle stability. There exist a constant c* and an integrable function ip* such that 
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(UB) Upper boundedness. There exists a constant C such that tp(x,y) < C for all 
x, y G X. 

(RC) Repulsion condition. There exist 5 > and < 7 < 1 such that for all x, y £ X 
with d(x, y) < 5 we have <p(x, y) < 7. 

Note that (RS) is the f orm o f Ruelle stability commonly used in spatial statistics, see 



Moller and Waagepetersenl (120041) . I f we c an choose ip*(x) as f3(x) times a constant, we 



get the classical definition by iRuelld (119691 ) . In any case Ruelle stability ensures that the 
unnormalized density u is integrable. 

If we can write the interaction function as tp(x,y) = e~ v( - x ' y \ then (UB) is equivalent to 
requiring that the potent ial V is bounded f rom b elow, which is a commonly used condition 
in statistical physics, see iKondratiev et al. d2012j) . 



Furthermore we introduce notation for the inner and outer ranges of attractive interac- 
tion. 

(IR) Interaction ranges. Let 5 < r < R be constants such that for all x, y G X with 

d(x,y) < r or d(x,y) > R we have ip(x,y) < 1. 

Note that such constants always exist due to (RC) and the compactness of X. 

Strictly speaking only inhibitory PIPs satisfy (S). However for our purpose it is actually 
enough to require finiteness of both the Jzf (S)-esssup and the Jzf (H)-esssup instead of the 
supremum in (S). This would also admit comparisons of arbitrary hard core PIPs. However, 
for the ease of presentation we deal with hard core PIPs together with the more general PIPs 
in Subsections 13.31 and 14.11 

2.3 Reduction to a diffuse reference measure cx 

In the remainder of this paper we will tacitly assume that the reference measure at is diffuse, 
i.e. satisfies ct({x}) = for any x e X. This implies that the Poi-process and the corre- 
sponding Gibbs processes are simple, i.e. with probability one do not have multiple points at 
a single location in space. It is then convenient to interpret a point process as a random finite 
set and use set notation, which is commonly done is spatial statistics. Thus we may write 
for example £ C r\ instead of £ < 77, or in the density of a PIP fi( x ) Y\{ x y }c£ < ^( x ' v) 

instead of Yli<i< n P( x i) Hi<i<j<n <P( x i> x i)- 

In addition to simplifying the notation considerably by making points identifiable by their 
location in space, assuming a diffuse ex. also reduces the differences in various definitions of 
the conditional intensity X(x | £) for the G ( to an a ® Poi-null set. 

We show in the appendix that our results in Section [3] carry over to the non-diffuse 
case. Essentially this is seen by extending the state space X to X x [0, 1] and considering 
a Cg> Leb|[n,i] a s a new (always diffuse) reference measure. This is based on an idea used in 



Chen and Xial (12004 ) . 
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3 Total variation bounds between Gibbs process dis- 
tributions 



3.1 Main results 

In this section we state our main results on bounds for the total variation distance between 
Gibbs process distributions. Define Ttv as the set of all measurable functions /: 5t— >■ [0,1]. 
Then for two point processes S and H, the total variation distance is defined as 

d w (^(2),JSf(H))= sup |E/(S)-E/(H)|. (3.1) 

feTrv 

By a simple approximation argument this is equivalent to 

d T v(j2f(S),Jgf(H)) = sup |P(S G A) — P(H G (3.2) 

The principal idea behind our proofs is a suitable variant of Stein's method, which we 
develop in Section The proofs of all the results in the present subsection are deferred to 
Section [6] as well. 

The gist of Stein's method for Gibbs(A)-process approximation may be summarized as 
follows. For / G Ttv write 

f(0-Ef(R)=Ah f (0, (3.3) 

where A is the infinitesimal generator of a spatial birth-death process and hj\ — > R is a 
suitable function in its domain. In general (13.31) is called a Stein equation. It turns out that 
we are able to estimate |E.A/i/(H)| uniformly in / G Ttv if we have an appropriate upper 
bound for the so-called Stein factor cj.(A), which is a term based on certain differences of 
the functions hf. 

Finding such a bound for ci(A) is usually the main challenge when applying Stein's 
method. To achieve this, we construct a coupling between two spatial birth-death processes, 
which is presented in Section [5j The explicit bound is given in the next proposition. Denote 
by ||-|| the total variation norm for signed measures on X. Thus ||£ — 77 1 1 for £, 77 G 9t is the 
total number of points appearing in one of the point configurations, but not in the other. 

Proposition 3.1. For any n* G N U {00} we have 

cr(A) < K - 1)! (-) - £ ^ + / - £ ^ + — £ (3.4) 

\ i=n* J{) i=n* / i=l 

where 

e— sup / \X(x I ^) — X(x 1 77)1 a(dx) < 00 and 
\\t-v\\=iJx 

c = c(n*) = sup / \A(x\ £) — \(x\r))\ a(dx) < 00. 

||e-?7||>n* Jx 

If n* = 00, we interpret the long first summand in the upper bound as 0. For e = and/or 
c = the upper bound is to be understood in the limit sense. 
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Remark 3.2. We often have a bound for c which does not depend on n* . In this case we 
choose n* = \c/e], which turns out to be optimal. 



Remark 3.3. If e < 1, we can choose n* = oo and obtain 



ci(A)<i±^log 



l-e 



< 



l + e 



Conditions of the type e < 1 are known in the statistical physics literature a s "low activity, 
high temperature" setting, see for example iKondratiev and Lytvynovl (120051 ). 



Remark 3.4. If H is a Poisson process, then X(x |£) = X(x) does not depend on £. We 
then have e = 0, and obtain Ci (A) = 1. Hence Proposition 13.11 and Remark 13.31 contain 
Lemma 2.2(i) in iBarbour and Brown! (119921 ) as a special case. 



Now we can state our main theorem. 



Theorem 3.5. Let 
satisfies (S). Then 



Gibbs(z/) and H ~ Gibbs(A) be Gibbs processes. Suppose that H 



rf TV (^f(H),j5f(H)) < Cl (A) / E\u(x\E) - \{x\E)\ a{dx) 

Jx 

where ci(A) is bounded in Proposition ^. 1\ 



(3.5) 



Remark 3.6. The assumption that S is a Gibbs process is used only in the proof of The- 
orem 13.51 for invoking the Georgii-Nguyen-Zessin equation, and is of course not needed 
for bounding Ci(A). The Georgii-Nguyen-Zessin equation may be generalized by replacing 
v[x | £) a.[dx) with the Papangelou kernel u(dx\^) of H if the so-called Conditi on (£) is 
satisfied , i.e. if F(p(A) = | S l ^c) > a.s. for every A E B. See iKallenbergi (119861 ). Section 
13.2, or lDaley and Vere-Joned (120081 ). Section 15.6, for details. 

We may therefore generalize Theorem 13.51 as follows. Let 5 be a point process that satisfies 
Condition (£) and has Papangelou kernel u, and let H ~ Gibbs(A) satisfy Condition (S). 
Then 

d w (^(H),JS?(H)) < ci(A)E||i/(dx|E)-A(a;|E)a(cfo)||, 
where Ci(A) is as above and ||-|| is the total variation norm between signed measures on X. 

For inhibitory PIPs we obtain the following theorem, which relates the total variation 
distance to the L 1 -distance between the interaction functions. 

Theorem 3.7. Suppose that H ~ PIP(/3, (fi) and H ~ PIP(/3, (p 2 ) are inhibitory. Let z/(y) = 
E(i/(|/ 1 S)) denote the intensity ofE. Then 



tW(^(S),JS?(H))< Cl (A) / / P(x)iy(y)\^ 1 (x,y)-< f2 (x,y)\cx(dx)a(dy), 

Jx Jx 

where Ci(A) is bounded in Proposition \3. 1\ with 



(3.6) 



c< I f3(x) a(dx) and e = sup / /3(x)(l — (p2(x, y)) ct(dx). 
x yex J x 



In the case where X C IR D ; /3 is constant and (fi(x,y) = (pi(x — y) depends only on the 
difference, we obtain 



(3.7) 



d w (jgf(S),Jgf(H))< Cl (A)/9E(|S|) / \ Vl {x)-ip 2 {x)\dx. 



Note that Inequality ( 13. 7p implies Inequality ( II. ip in the introduction, since E(|5|) = 
fE(u(y\S))dy<P\X\. 

If one of the processes is a Poisson process, we obtain both a s light improvement and a 
very substantial generalization of the bounds in lBrown and Greigl (I1994I ). 



Example 3.8. Let H be a Poisson process with intensity function {3, and let 5 ~ PIP(/3, (p) 
be inhibitory, denoting its intensity function by v. By Theorem 13.71 and Remark 13.41 we 
obtain 

<W(JS?(S),JS?(H))< / / ^)Ky)(l-^,y))aWa(iff). (3.8) 

J x J x 

The special case where X = [0, 1} D with torus convention, a is Lebesgue measure, and 
H is a station ary hard core proce s s with cons tant /? and ip(x,y ) = l {\\x — y\\ > r} was 
considered in Barbour and Brownl ( 1992 ) and Brown and Greigl ( 1994 ). except that these 
articles approximate by a Poisson process H that has the same intensity v as S. We obtain 
from (EU) that 

drv(jgf(S), JSf (H)) < /3i/a D r D < /3 W D . (3.9) 

The best bound in Brown and Greigl ( 19941 ). namely Inequality (12), says that under a 
somewhat complicated additional condition on the parameters we have 



dav^E), JSf(H)) < ( 1 + ^ Jz/W D 



(3.10) 



By a straightforward upper bound on /3 (see iBrown and Greigj . I1994J . Inequality (11)) our 
result ( 13 .9p may be bounded further to obtain 



d TV (J?(E),J?(K))< 



1 



1 — voir,r 



-^v 2 a D r D 



for r < l/^z/a^) 1 /- , which holds without the additional condition and is an asymptotic 
improvement over ( I3.10P by a factor of 2 D /(2 D + 1) as r — > 0. 

This suggests that for small r it is better to approximate 5 by H than by H, both because we 
get a smaller bound and because the intensity of H is known explicitly from the parameters 
ofS. 



Example 3.9. A PIP is called a Strauss process, if its interaction function is given by 

(p(x,y) = 



if d(x, y) < R, 
if d(x, y) > R, 
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for some constants < 7 < 1 and R > 0. Let 5 and H be Strauss processes with constant f3 
and further parameters ji,Ri and 72,-^2, respectively, where Ri > R 2 - Denote by M(y,R) 
the closed ball in X with centre at y and radius R. Then by Theorem 13.71 

drv(JS?(S),J&?(H)) < Cl (A)E(|H|)/3sup f(l- 7l )a(B(y,i2 1 )\B(j/,i2 2 ))+|7 1 -7 a |a(B(y,i2 2 ))), 

(3.11) 

where C\[X) is bounded in Proposition 13.11 with 

e = ^(1-72) sup a(B(y,i?2)) and c<fia(X). 



3.2 Processes violating the stability condition (S) 

Many Gibbs processes satisfy Condition (S), but there are some important exceptions. In 
the present subsection we provide a technique for treating these exceptions. In Subsection [33] 
we apply this technique to general PIPs. 

We call an event A hereditary if the corresponding indicator function is hereditary, i.e. if 
7] G A implies (eA for all sub configurations £ C rj. 

Let A be a hereditary event such that P(H G A) > 0. Let H A ~ ££ (H | H G A). For 
instance, if A = {rj G 91: < M} for some M G N; then H A has the same distribution as H 
conditioned on not having more than M points. In many cases Ha then satisfies (S), even 
if the original process H does not. 

The following two lemmas are needed for reducing the problem of approximating by the 
process H to a problem of approximating by Ha- 

Lemma 3.10. The process Ha has hereditary density ua{0 = £ A}/P(H G A) with 

respect to P01 and conditional intensity Xa(x | £) = A (x | + 5 X G A}, where u and A 

denote the density and conditional intensity of H, respectively. 

Proof. Note that for all measurable / : — > M. + 

mn, - E «/,H) , - «p - 1 /( 0H|M|^1 Vom . 

Furthermore, by the definition of the conditional intensity, Equation (12.21) . 

where the last equality follows by the hereditarity of A. □ 
Proposition 3.11. Let A be a hereditary event, and let Ha ~ j£?(H | A). Then, 

d TV (Jtf(~), jgf (H)) < dw(^(S), JSf (H A )) + P(H $ A). (3.12) 
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Proof. Note that 



d w (^(H A ), JSf(H)) = sup |P(H G 5 | H G A) - P(H G S) 

-Be A/" 



sup 



sup 

BeAf 



P(H eB,HGA) 



P(H g A) 



— P(H 6 B, H 6 A) - P(H £5,H^A) 



;i-P(HGA))^A^-P(HGB,H^) 



/ , , N P(H G 5, H G A) 
<max sup P H i A) 1 ' ; , sup PH G5,H^) 

V-BeA^ P(H G A) SeA r 

=P(H i A), 
and the triangle inequality yields the claim. 
Corollary 3.12. For hereditary events A and A' we get 

d w (^(3),J&?(H)) < d TV {J?(E A ),J?(K A ,)) +P(H £ A) +P(H £ A'). 



□ 



3.3 General pairwise interaction processes (PIP) 

For PIPs the following hereditary event is very useful. Let k G N, 5 > 0, and 

A k = {£em: supf(B(y,5/2))<fc}, 



(3.13) 



i.e. we require that the PIP has at most k points inside any closed ball with radius 5/2. If 
k = 1 this is equivalent to the event that the PIP has a hard core radius 5. 

Lemma 3.13. Suppose that H ~ PIP(/3, ip) satisfies the conditions (RS), (UB) ; (RC) and 
(IR) with the constants C , 5, 7, r and R. Then 



P(H £ A fc ) = P(3y G # : H(B(y, 5/2)) > jfe + l) < 
Proof. Note that by Equation fl2.ll) 



fc( fc + 1 ) oA . 
7 2 B k s 

(k + l)\C k 



E(|H|C fc l H l), (3.14) 



P(3yG^:H(l(y,5/2))>fc + l) = e- Q W jr ± f . . . f u ({x 1} . . . , x n }) 

n=k+l ' ^ X " X 



l{3y G Af, 3{ii, . . . , z fc+ i} C {1, . . . , n} : x h 



The indicator in (" 13 . 1 5[) is less than or equal to 



, ■ ■ ■ j %i k+1 G 



#/2)} cx(dxi) ■ ■ ■ at(dx n ). 

(3.15) 



l{EyeX,x h ,...,x ih+i eM{y,5/2)}. 

{«!,.••• tfc+l}C{l,...,7l} 



(3.16) 
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For any permutation (ii, . . . , i n ) of (1, ... , n) the density can be rewritten as 

k \ / \ k n 



U 



n^(^i))( n v(xi J ,x il )\u({x ik+1J ... J x in })Y[Y[ip(x ij ,x il ). 

j=l ' V<j<Kfc+l ' j=l l=k+2 



Thus by (RC) and (UB) we obtain 
u({xx, . . .,x n }) l{3y G X, x h , . . -,x ik+1 G M(y,5/2)} 



k 

< 



(fl ft x *i)) 7 (fcJ1) «(K +1 > • • • , x t J)C k( ~ M H{3y eX, Xil ,..., x tk+1 G B(y, 5/2)} 

where the last line follows by the triangle inequality. Thus in total from Equation (13. 15ft 
P(H i A k ) 

e h e /•••// •-/ (n^))>< 

n=k+l ^ fa^QciWCi^ ^Hij jB K + i'f j=l ' 

fi-fc fe 

1 V^(n-fe-l), /r/ 



7 ( 2 )c*(« fc ^({i^, . . . ,x in }) at(dx h ) ■ ■ ■ a.{dx ik ) cx(dx ik+1 ) ■ ■ ■ cx(dx in ) 

fc ( fc + 1 ) oo , r r 

^ 12^rk e ' a{X) E 7 !L: rw Cf * (n "* ) / ■■■ / «({^ + i,...^n})a(^ fe+1 )---a(^ n ) 
{k+l)\C k „±^An-ky. J x J x 



7 2 ' B " : E(\R\c k ^) 



n—k 



fc(fc+i) 

(& + l)!C feJ 

by Equation ([XT]) . □ 



Lemma 3.14. Consider the process H^ fc , whereH ~ PIP(/3, <p) satisfies the conditions (RS) ; 
(UB) ; (RC) and (IR) the constants C , 5, 7, r and i?. Define M k = C mk < 00 roi/i 

m k = sup £(A(x,r,.R)), 

where A(x, r, i?) = B(x, R) \ M(x, r) . Then 

X Ak (x I = \(x I 01{C + <** e M < /3(x)M fc . 

77ms means that the new process H^ fe is locally stable and hence satisfies condition (S). 
In the Euclidean setting m k < mk, where 



m = a D D D / 2 ^ 



R \D (T \ D 
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Proof. By Lemma [3]T0] and Conditions (UB) and (IR) we see that \ Ak {x | £) can be bounded 
by 0(x)C^ A ( x ' r < R »l{£ + 5 x e A k } < (3(x)C mk , and m,k is finite, since it can be bounded by 
k times the minimal number of balls with radius 5/2 needed to cover A(x,r,R). 

In the Euclidean case, consider a partition {Qi\f =1 of X by cubes of edge length 8/yD. 
Since the diameter of each cube is 5, one can cover each cube by a ball with radius 5/2. 
Furthermore a cube can intersect A(x, r, R) if and only if it is contained in A(x, r — 5, R + 5). 
Thus the number of cubes intersecting A(x, r, R) can be bounded by the volume of A(x, r — 
5,R + 5) divided by the volume of a cube, i.e 



\A(x,r-5,R + 5)\ , nD/2 / (R , A D fr \»\ 
Thus rrik < mk. □ 



The next theorem is a generalization of Theorem 13.71 that includes non-inhibitory PIPs. 

Theorem 3.15. Assume that E ~ PIP(/3i,<^i) and H ~ PIP(/3 2 , y^)- Furthermore assume 
that they satisfy the conditions (RS) ; (UB), (RC) and (IR) with the same constants C , 
5, 7, r and R. Let v Ak {y) = ~K{y Ak {y 1 H^ fe )) denote the intensity o/H^. Then we have for 
any k G N 

tM^(S),JS?(H))< Cl (A)M fc / / ${x)v Ak (y)\ <Pl (x,y)-(p 2 (x,y)\a(dx)a(dy) 

J X J X 

k(k + l) . 

+ (I+\)!^ E(|S|Cfc ' 5 ' + |H|Cfc ' H|) ' (3 - 17) 



where M k is defined in Lemma \3. IJ\ and Ci(A) zs gwen m Proposition ^. 1\ with 



c < Mk / a (<ix) and 



x 



£<M fc sup( / (3(x)\(p 2 (x, y) — 1| a.{dx) + / /3(x) ct(dx) 

Note that k G N can be chosen such that the two terms in (13.171) are best balanced. 
There is also some freedom in the choice of 5 in (IR). In particular it is always possible to 
choose a lower 5 at no cost for 7, i.e. the last term in (13. 17|) can be made arbitrarily small 
by letting 5 — > 0, which on the other hand leads to an explosion of M/,. 

Remark 3.16. Usually the expectations in (I3.17P are not easy to compute, but at least 
Ruelle stability guarantees their finiteness. Let u be the density of H, let ip* be as in (RS), 
and write oc(i/j*) = f x ip*(x) ot(dx). Then for a constant c** > 

E(\E\C k ^) < c**e- a ^ Yl , = c**C k cx(r)e Cka{r) - a{X) < 00. 

n=0 n ' 

Remark 3.17. If one is interested in a very specific Gibbs processes model, then the esti- 
mates in Lemma 13.131 and Lemma 13.141 may be improved. See for instance Subsection 14. 1[ 
where we treat Lennard- Jones type processes. 
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By a slight adaptation in the proof of Theorem I3.7[ we can get a nicer result for hard 
core PIPs. 

Theorem 3.18. Let S ~ PIP(/3 1 ,y9 1 ) and H ~ PIP(/3 2 , ^2) ■ Assume that both processes 
have a hard core radius of 5 > 0, and satisfy (UB) and (IR) . Then 



<h v (Sf(E),Jf(R))<c 1 (X)M 1 / /3(x)u(y)\ t p 1 (x,y)- l p 2 (x,y)\a(dx)a(dy). (3.18) 

Jx J x 

Proof of Theorem \3.15[ We adapt the proof of Theorem 13.71 to compute a bound for 
dTvi^i^Ak), -^(HajJ)- Let £ = J^" =1 Since both and (p 2 are bounded by the same C, 

Y[<Pi(x,yi) \ ( f[ <p2(x,yi)\l{Z + 6 x eA k } 

< C ,?(B(x,i?)\B( a; ,r)) 1 |^ + ^ e < ^ = Mfc _ 

Thus 

d TV (^(H A J,J5f(H A J) 

/■ /■ (3.19) 

< ci(A)M fc / / 0(x)i/A*(y)|¥>iOp,2/) -¥>2(a:,l/)| a(tir) a(dy). 



Since by Lemma [3. 141 |AA fc (s | £) — A^ fc (x | t/)| < (3(x)M k for all a; G A' and £,77 G 91, 
we have c < f x /3(x) ac(dx). By Remark 16.21 the supremum in the formula for e can be 
replaced by an essential supremum with respect to J?f (S^ fe ) + Jzf (H^J. This implies that it is 
enough to take the supremum only over £, r] G Note that if d(x, y) > 5, £ + 5 X G v4 fc and 
£ + S y G Afc, then also £ + 5 X + 5 y E A k . Therefore, by using X(x | £ + 5 a ) = A(x | £)(^ 2 (a;, y), 
we obtain 



£ = sup / \\A k (x I 77) - \A k (x I £)| a(rfx) 
S^eAfe, ||£-»?||=i Jx 

sup / |A(ar | £ + <y 1{£ + 8 X + 8 y E A k } - X(x | £)!{£ + 5 Z G a(tfe) 
< sup (/ \(x\€)\<p 2 (x,y)-l\l{Z + 6 ai + 6 v eA k }a(dx) 

yeX,£+6 y eA k \Jx\M(y,8) 

+ / |A(x \£ + 6 y )l{Z + 6 x + 6 v E A k }-X(x | £)!{£ + 4 E A k }\ a(dx)) 

Jn(y,6) J 



<M fc sup( / /3(x)\(p 2 (x, y) — 1| a(dx) + / /3(x) a(dx) J . 

The claim now follows by applying Corollary 13 . 121 with A = A' = A k and Lemma [3]T3j □ 

Proof of TheoremWJR Note that Jgf(S) = -5f(H Al ) and JS?(H) = if(H Al ). The statement 
follows now from Inequality (13.19!) . □ 



14 



Example 3.19. Let X C IR D . A PIP is called a multi-scale Strauss process, if its interaction 
function (p(x, y) depends only on ||a; — y\\, is piecewise constant and takes only finitely many 
values. We restrict ourselves to bi-scale Strauss processes, i.e. the interaction function is 
given by 

7, if ||x — y\\ < r, 
if(x, y) = {C if r < \\x — y\\ < R, 
1 if \\x — y\\ > R, 

for some constants 0<7<1,C>0 and < r < R. To ensure (RS), we furthermore 
require that C < 7 -1 /( 2m ) with 

m = m(r, R; D) = a D D D/2 (- + l) °. 
Note that m is t h e sam e as in Lemma [3. 141 with 5 = r. (RS) then follows by a criterion 



of iKondratiev et al.l (120121 ). Subsection 2.3. The authors use m as a bound on the maximal 
number of cubes with edge length 8/yD that intersect the annulus A(0, r — 5, R + 5), as we 
did in Lemma 13.141 

To illustrate Theorem 13.151 let H and H be bi-scale Strauss processes with constant 0, 
the same 7, r, R, and with 1 < Ch < Co. The ingredients for computing ci(A) are 

£ = a D (3C™ k (r D + (C n -l)(R D - r D )) and c = (3C™ k \X\. 
since 7 < C s 2m , Theorem [315] yields 

drv(J2f(S), Jgf(H)) < c 1 (A)a D /3Cg lfc E(|H A J)(C s - C H )(i? D - r D ) 

Qpp r ^- mfc 2_ (m+1)A .^ ijj|C'*I h ^ 
(/c+l)! H V s / 

4 Applications 

4.1 Lennard— Jones type processes 

In this subsection let (X, a) be a compact su bset of R D eq uipped with Lebesgue measure. 



We say a PIP is of Lennard- J ones type, see iRuelld (Il969[ ) , if its interaction function can 



be written as <p(x,y) = exp(— 6V(||a; — y\\)), and the pair-potential V satisfies the following 
conditions. 

(1) There exist r < R and a g > D such that 

V(x) > for ||x|| < r 

V(x) > -\\x\\- e for ||x|| > R. 

(2) V(x) > —M for a M > and for all x > 0, i.e. the interaction function is bounded 
from above by e bM . 
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The technique used in Subsection 13.31 stands and falls with a good estimate on the term 
su P£e9t ^A k {x I £)■ The next lemma gives a neat replacement of Lemma 13.141 for Lennard- 
Jones type processes, which then yields an analogon of Theorem 13.151 

Lemma 4.1. Assume that H is a PIP of Lennard-Jones type with constants g,r,R,M. 
Choose a positive 5 <r such that also 5 < R/2. Define 

M t = expf^M + ^(^) B %zgl)\ (4,) 



q-D\ 5 J (R-25y- 1 



where 

l) D )l{r < R}. 
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m = m(r, R; 5) = a D D D ' 2 (fj + 1^° - ( 
Then for all x G X and for all £ G 9t we have 

\ Ak (x \ < P(x)M k . 

Proof. Since if < e bM , we obtain analogously as in the proof of Lemma [3.141 and by using 
the translation invariance of V that 

sup X(x | < /3(x) sup exp ( bM£(A(x, r, R)) - b V] v (\\v\ 

5eAfe « eAfc V yeUv\\>R 



bMmk 



< 0{x)e 



supexpf-6 V (\\y\\))- ( 4 - 2 ) 



y^,\\y\\>R 



Let {Q z } z& z D denote the partition of R 13 into cubes of edge length 5/ \/D and centre points 
5/\TD r L D . Since £ G A kl each cube contains at most k points. A cube intersects B(0,i?) c 
if and only if its centre point is contained in B(0,i? — S/2) c . For x = {x\, . . . ,£d) G M. D , 
denote llxlLax = maxj = i ... d\xA. Thus 



sup - v (h\\)< Yl k sup x f~^" x ") 

^ &Ak y^,\\v\\>R zezP,\\z\\>VD(R/s-i/2) U^IU^I VV U ' 



4=) 8 E su p } x \\~ e - ( 4 - 3 ) 

' z& D ,\\z\\>-/D{R/ 8-1/2) \\' X - Z W^^\ 



Consider the function g(x) = (\\x\\ — VD) e . Since 



inf g(x) > [\\z\\ ) > sup \\x\\ e , 



% 1 1 max 5^ 2 

the last sum in (14. 3[) can be bounded by an integral over the function g. For any a > 
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we get 



r poo 

/ (||x|| - v f DT e dx = a D D / (r - Voy^r - 1 dr 

</B(0,a) c Ja 

-_ aoD H ( r _+^R\ r D-i- edr < anD ( —l—V 1 r r D -^dr 

Ja-VD\ r J \a-VDj J a -y/D 

-a D D I - 



\ E> - 1 1 1 
a \ 1 1 



\a-y/DJ e~ ^ (a- VD)s- D ' 

In order to catch all cubes in ( 14. 3p . the integration must begin at a = yD(R/8 — 1), which 
together with (14. 2p yields the claim. □ 



Example 4.2. Consider the classical Lennard- Jones process in three dimensions, i.e. 

V(x) = 



R\ 12 ( R 



\x\\ 

Then one can choose r = R and g — 6, thus 

4.2 The hard core process as limit of area interaction processes 

For simplicity let again (X, a) be a compact subset of M. D with Lebesgue measure. In this 
subsection let M(x, R) always denote the closed ball in M D rather than in X. Let H be a 
Strauss process with parameters R, (3 > and 7o = (hard core case). Let furthermore 
H := Ep^ be an area interaction process with parameters R/2,{3,j, where 7 G (0, 1]. The 
unnormalized density of such a process is given by 

and the conditional intensity is therefore 

V U J = p -\*{x,R/2)\jJ vG i*<3lJll*)\. 

Baddeley and van Lieshout (|l995h for more details. The authors show that j£f (H^ 



sec 
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-Sf(H) weakly as /3, 7 — >■ in such a way that /3^~ Q o(«/2) _j. ^ Q ^ j g eas jiy seen that the 
hard core process referred to by Baddeley and van Lieshout is in fact the Strauss hard core 
process we use). We derive a rate for this convergence. 

Theorem 4.3. Let H and H be as above. Then, 

d TV (JZ>(Z),JZ>(K)) < Cl (A)(|/3 7 - ai5W2)I5 -Po\\X\ +^~ aD{R/2)D E\E\I D (R, 1 ) 



where 

I D (R, 7 ) : = ! T IB(^/2)nB(o,fl/2)| rfa; < 2a D DR D ~ 1 log^)- 1 ^ 

JM(0,R) 
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Proof. For the difference between the conditional intensities we obtain 
where dist(x, A) = inf yg ^||x — y\\ for any A C M D . Therefore 



z/(x|0-A(x|0 



/K,|0-a(.|0|^ 



< / 1^-^,^/2)1 _ dx + / 7 |B(x,H/2)nU !/e5 B( y ,R/2)| ^ 

J i{£:dist(x,0<^} 

The last integral may be bounded further by 



|f | / 7 IB(^/2)nB(0,i?/2)| dx < |£| f 7 a D (R-\M) D /2° d% 

JB{0,R) JM(0,R) 

Jo 

By the substitution y = log(7~ aD ) (^y-) D this is equal to 

| { | a ^io g(7 -)-^|^ D)(f) %^( fl - 2 ( j _^)*) M < 4 ( 

1 POO 

< 2\i\a D DR D - 1 \og{ 1 - aD )- 1,D ^ J e-yyo- x dy < 2|e|« DJ D^" 1 log( 7 



The last inequality holds since the integral is equal to T(l/D) and the functional equality 
of the Gamma function yields (1/D) V(l/D) = T(l + l/D) < 1 for all D > 1. The result 
follows now from Theorem 13.51 □ 



The next proposition shows that in the case /?7 a D{R/Z) D _ ^ Q above rate is sharp. 
Define X^ := {x G ;f : dist(x, > R} and choose i? such that \X ( ~ R °^\ > 0. 
Proposition 4.4. Lei H and H be as above. Assume that fi^~ a o( R / 2 ) D = fi Q an d R < 
Then there exists a positive constant k such that 

d T v(^(S),JS?(H)) > Kl D (R,<y). (4.4) 
Proo/. Define A = {£ G 0T: 3{x,u} C £, \\x - y\\ < R}. Note that P(H E A) = 0. Hence 
drv(Sf(E),Sf(H)) = sup |P(S G B) - P(H G S)| > |P(3 G A) - P(H G A)\ = P(S G A). 

Denote by c~ the normalizing constant of the density of 5. Then 
P(3 G A) > P(S G A |S| = 2) 

= Cs £^ / / /3 2 T -[B(x ll fl/2)UB( a!a ,il/2)[ 1 ^|| a . i _ X2 || < ^1 ^ ^ 2 

= C JL / / ^2 -|B(* x ,fl/2)| -|B(* 2 ,fl/2)| |B(*i,fl/2)nB(aia,fl/2)| ^ ^ 

2 J* Jm(x2,R)r\X 

> CH ^ /3 2|A'(- fi )| / 7 |B(^/2)nB(CW2)| ^ 



M) 



where we used that B denotes a ball in R and the translation invariance of the Lebesgue 
measure. Since 7 < 1, we have w s (£) < c s /3^' for all £ G 9t. Integrating with respect to P01 
yields c s > exp(|#|(l - /3 )). Thus one may choose re = e-^^l^-* )]^. □ 

4.3 Discrete Processes 

Let (X, a) be a general space with a diffuse measure ex. Our aim is to compare Gibbs 
processes which live on a finite subset A = A n = {yi}f =1 of X with Gibbs processes on X. 
Let V = {Vi}™ =1 be a partition of X such that j/j G V$ for all i = 1, . . . , n. A natural 
choice is the Voronoi tesselation, provided that ot(dVj) = for all i = l,...,n. Define 
ry = maxj = i ) ... jJl sup xg y yj), the maximal radius of the cells in V. Furthermore let 
(91a, A/a) denote the space of point measures on A with its natural a-algebra, which coincides 
with the power set of 91 

Define the reference measure cka on A by ex^({yi}) = ex{Vj) for all % — l, . . . ,n and let 
Poa denote the Poisson process distribution on A with intensity measure ct\. The Gibbs 
point processes on A are then defined in the obvious way, i.e. as the point processes that 
have a hereditary density with respect to Poa. 

Let Ha ~ Gibbs(-UA) be a Gibbs process on A. Define a point process Eu on X in the 
following manner. Each point of E\ is replaced by a a^ly/a^) -distributed point in the 
corresponding cell V*. More formally, if H A = XT=i then 

n Ni 
i=l 1=1 

where the Uu are all independent and Uu ~ ol{-)\vJ ociVi). 

Define a function £: X — > A which maps each point in X to its lattice point in A, i.e. 
t(x) — yi if x G Vi for i — l, . . . , n. In the same spirit set £(£) = t(XLe£ ^) = XLe£ ^t(x) f° r 
every ^G2t. 

Lemma 4.5. Let 2a ~ Gibbs(-UA)- TTien the corresponding point process Eu on X has 
density uu(£) = ma (*(£)) ™^ respect to Po\. 

Proof. For ji, . . . ,j n G Z+, set k = YTi=i3i and 

An,.., jn {i < 91: aVi) = ./:• • • ■ ,£(K) = jn}- 

Then, writing 

/ k \ k\ 
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for the multinomial coefficient, we have 



F(~u G A n _ jn ) = P(~ A G A n _ 3 

e -ct{X) ^ f k 



k .)<x(v 1 y\.. a (v n y™ 

\Jl, ■ ■ ■ , 3nJ 

Jx Jx 



k\ 



j{#{r: x r G Vi} = jx, ■ ■ ■ , x r G K} = in} a(dxi) ■ ■ -a.{dx k ) 

= P(HgA jW J, 

(4.6) 

where 5 ~ Gibbs (uao£). 

Note that the density u\(t(-)) is constant on any Aj lt ^ j n . Hence, given that S(Vi) = 
jx, ■ ■ ■ , E(V n ) = j n , the process E may be written as J^ILi S?=i <^«> where the Uu are all 
independent and Uu ~ a(-)|y i /a(Vi). Thus, for every measurable h: 9t — > WL + we get 
E(/i(H[/) | A,- lv .. Jn ) = E(/i(H) | Aj ly __ ; j n ), which together with Equation (14. 6p and the formula 
of total expectation yields the claim. □ 

Many Gibbs processes E on X with density u have a discrete analogon, which is obtained 
by restricting the density to A and renormalizing, i.e. by using the unnormalized density 

= U W A on 9^A) provided that it is CKA-integrable. Some special care is required when 
evaluating u at point configurations £ G 9t that have multi-points. For the continuous Gibbs 
process such £ form a null set, whereas for the discrete analogon the values of u\ at such 
£ become important. We avoid this problem by assuming that = for any £ with 
multi-points, which leads to discrete analoga Sa that may be represented as collections of 
Bernoulli random variables (I y ) yeA . Consequently Ejj may have no more than one point in 
any cell V^. 

By Theorem 13.51 and Lemma 14.51 we immediately obtain the following proposition. 

Proposition 4.6. Suppose that E ~ Gibbs(z/) satisfies (S). Let Ha be its discrete analogon 
and let Ejj be given by (14. 5 j) . Then, 

d TV (Jf(E u ),^f(E)) < cx(X) [ E\u(t(x) | t(Eu)) - v(x \ E v ) \ a(dx). (4.7) 

Jx 

Consider the special case where 5 ~ PIP with a constant (3 and with (p < 1 

(inhib it ory c ase). Our process 5 a is then an auto-logistic process in the terminology of 



Besagj (J1974J). In iBesag et al.l (119821 ) convergence of the corresponding H;y-process density 
towards the S-process density is studied under a continuity condition on <p, without providing 
rates. 

We have 

u(t(x) | t(Eu)) = /? J] <PM x )Av)) a - s - 
Imitating the proof of Theorem 13. 7\ we obtain from Proposition 14.61 the following result. 
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Proposition 4.7. Let H ~ PIP (ft, p) with constant ft and p < 1. T/ien 



drvCJgfCS^.JgfCS)) < Cl (\)E\Su\p sup / ^),%)) -p(x,y)\ a(dx). (4.8) 

ye* 7 x 

Corollary 4.8. Let H ~ PIP(/3, </?) wift constant ft and a p> < 1 £/ia£ zs Lipschitz continuous 
with constant L in both components. Then 

<kv(S?(Eu), JSf (S)) < 2c 1 (A)E|H c/ |/3La(;t')ry. (4.9) 

Proof. Note that by the triangle inequality 

|y>(t(o;),t(2/)) - p(x,y)| < |p(t(ar),t(y)) - ¥>(t(x),2/)| + |y(t(a;),y) - <^(x,y)| < 2£ry. 

□ 

Example 4.9. Let H be a Strauss process, i.e. p(x,y) = 'y 1 ^ d< y x ^^ R ^ for a 7 G [0,1]. Write 
A(y,Ri,R 2 ) = {x E X : Ri < d(x,y) < R 2 }. For x ^ R - 2r v , R + 2ry), we have 

l{d(*(*),t(v))<fl} _ iRa^A} _ q 

and for x G /!(?/, i? — 2r v , R + 2r„) the modulus of the above difference is at most (1 — 7). 
Hence by Proposition 14.71 

d TV (^(Eu), JS?(S)) < c 1 (A)E|H l/ |/3(l - 7) sup a{A{y, R - 2r v , R + 2r v )). (4.10) 

In the Euclidean case we get a linear rate in ry, since R — 2r v , R+2r v )) < Ao>e>D(R+ 

2r v ) D ~ l r v . 

Remark 4.10. By combining the techniques of Corollary 14.81 and Example 14.91 we obtain 
linear rates in ry for any inhibitory interaction function p that is piecewise Lipschitz con- 
tinuous. 

To compute the distance between H and its discrete analogon H A we need another 
distance than the total variation. This is because P(S C A) = P(|S| = 0) and thus 
g?tv (-^(Sa), > 1 — P(|H| = 0), whereas one would like to have a distance that van - 



ishes as ry — > 0. We use the following Wasserstein metric; see iBarbour and Brownl ( 119921 ) 



Section 3, for details. Let £ = Y^=i an d V = Y^Li &vi- Define a metric di on 91 by 



1 if n 7^ m, 

i min CTe5n J]r=i min(rf(xi, 1) if n = m, 



where S n denotes the permutation group of order n. Denote by T 2 the set of functions 
/: — > [0, 1] such that — /(//)| < (£,'?) for all £, 77 G 9t. Our Wasserstein distance is 
then defined by 

d 2 (JSf(3),JSf(H)) = sup |E/(S) -E/(H)|. (4.11) 
We obtain the following theorem. 
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Theorem 4.11. Suppose that E ~ Gibbs(z/) satisfies (S). Let 2a be the discrete analogon. 
Then 

d 2 (-Sf(S A ),-S?(S)) <r y + Cl (A) / E|i/(t(x) |t(S^)) - v(x\Eu) \ ot(dx). (4.12) 

J* 

Proof. We have 

d 2 (jSf (S A ), J&f (E)) < d 2 (JSf (S A ), J2f (S y )) + tZ 2 (J2f (Hy), Jgf (S)). 
For the first summand we obtain by the Lipschitz continuity of / G J-" 2 that 
sup |E/(Ea) — E/(Ej/)| < Ed 1 (E A ,E u ) < r v , 

where we used that the distance between any point in Ha and its replacement point in 
is at most r v . 

Since Ti C Ttv, the Wasserstein distance is always bounded by the total variation dis- 
tance. The second summand above may therefore be bounded according to Proposition I4.6[ 
which yields the claim. □ 

5 Couplings of spatial birth-death processes 

Let b(- 1 •), d(- 1 -) : X x OT — >■ M+ be measurable functions such that := J b(x | £) ot{dx) 
< oo for every £ G OX and set = XLe£ I ^ anc ^ = ^ ^ spatial birth- 

death process (SBDP) (Z(t)) t > with frzrf/i rate 6 and deai/i rate d is a pure-jump Markov 
process on 9T that can be described as follows: given it is in state £ G OT and a(£) > it 
stays there for an Exp(a(£))-distributed time, after which a point is added to £ ("birth") 
with probability 6(£)/a(£) or deleted from £ with probability c?(£)/a(£) ("death"). If a birth 
occurs, the new point is positioned according to the density b(x | £)/&(£)• ^ a death occurs, 
the point x G £ is omitted with probability | £) / ) . In th e case d(£) = the SBDP 
i s abs orbed in £, i.e. stays there indefinitely. Preston ( 19751 ) and Moller and Waagepetersen 



( 120041 ). Chapter 11 and Appendix G, give more formal definitions of general SBDP and a 
wealth of other results, including conditions to assure that the SBDP is non-explosive, i.e. 
that with probability 1 only finitely many jumps can occur in any bounded time interval. 
Denote by (Z^{t)) t >o the process with deterministic starting configuration £, i.e. ^(0) = £. 

We concentrate here on the case, where b(x | £) := X(x | £) and d(x | £) = 1 ("unit per- 
capita death rate") and where Condition (S) holds. Thus we obtain a non-explosive SBDP 
that is time-reversible with respec t to Gibbs(A) and converges in distribution to Gibbs(A) (see 



Moller and Waagepetersenl . 12004 . Propositions G.2-G.4). Time-reversibility with respect to 
Gibbs(A) means that if Z(0) ~ Gibbs(A), we have that (Z(t)) t e[o,T] and (Z(T — t))te[o,T\ have 
the same distribution for any T > 0. Since Condition (S) holds, we may also characterize 
the SBDP with birth rate A and unit per-capita death rate as the unique Markov process 
with infinitesimal generator 

Ah(£) = [ [h(£ + 5 X ) - h(0] A(x 1 a(dx) + [ [h(£ - 5 X ) - h(0] Z(dx) (5.1) 
J x J x 
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for all bounded measurable h: X — > R (see lEthier and Kurtzl . Il986l . Section 4.2 and Sec- 



tion 4.11, Problem 5). The domain $!{A) of A is the set of all measurable functions h for 
which the right hand side above is well-defmied. St(A) contains at least all the functions h 
for which sup^^g^. 

In what follows we construct a coupling (Z^(t), Z v (t)) t >o of two SBDPs with identical 
birth rate A started at individual configurations £, r\ £ OT. We introduce the notation 



X max (x | £,v) = max(A(x | £), X(x \ T))) A max (£,?7)=/ A max (s | £, rj) a(dx) 

J x 

AminO | £,rj) = min(A(x | £), X(x \ rj)) A min (^, r/) = / A min (x | f, 77) a(dx). 

J x 

Define (Z%, Z v ) as a pure-jump Markov process with right-continuous paths, holding 
intervals Di, D 2 , . . ., start time T := 0, and jump times Tj := YH=i D% for all j > 1. Given 
Z^{Tj^.i) = Z v (Tj-x) = rf the distribution of the next jump is described by the following 
random variables, which are assumed to be independent of one another unless specified 
otherwise. Let 

J D i ~Exp(A max (e , ,r / / ) + |OUr / '|), 

Amax(<f , if) 



Gj ~ Bernoulli ( = 

3 vA max (e,r/o + ieuvi 



A max (-|r,V) 



Amax^',??') 

C^~Unif(e'U7/), 

' *~ BemOUm ( A^TO 
B qJ I y,~ Bernoulli ( - A ffi'ff 

where B^j and B Vt j are maximally coupled given Yj, i.e. 

A mi „(F,|r,V) 



= B vd = 1 1 



3 j 



KUYj\e,v') 



If = 1, set Z{(Tj) = Z^iTj-i) + B^Sy^ and Z v (Tj) = Z n (Tj-\) + B^jSy^ If Gj = 0, set 
Ztft) = Z^ x ) - l{Uj E Z.il) ,)} f y. and Z V (T 3 ) = Z V (T^) - 1{U S E Z^T^)} 5 Uy 

Proposition 5.1. Both components Z^, Z v of the coupling are SBDPs with generator 115. 

Proof. Since by Condition (S) the rate et(£, rf) = A max (£, rf) + | ^ L_J 77 1 is bounded by c+ |^Ury|, 
for a constant c > 0, we get 

F(D 1 >t) = l- a(^r])t + 0(t 2 ), 

P(Di < t, £>i + D 2 > t) = a(£, r?)t + 0(t 2 ) and 

F(Dt + D 2 <t) = 0{t 2 ) 
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as t — > 0. Thus for a bounded function h 



Eh(Zz(t)) 

= t( W£, v) + U 77I) (EMC + S €)1( Jy 1 )P(G 1 = 1) + E/i(£ - l{C/i G e}^)P(Gi = 0)) 
+ (1 - t(A max (e, ^) + K U 77|))MO + 0(t 2 ). 

Since 

P(G 1 = 1) = 1 - P(G 1 = 0) Amax( ^' r?) 



Amax(e,»7) + ^U77|' 

we obtain 

lim Efe(^(t))-fe(0 

t->o i 

= A max (£, 77)E(/i(e + ^,1^) - + U tj|E(/i(£ - HU, G Z}6 Vl ) - h(0) 

JA 1 Is u ?7I 



= / [Me + «y)-M0]A(y|0a(dy)+ / [HZ - 5 U ) - h(0] Z(du) 
Jx Jx 

= Ah(0. 



□ 



Define the coupling time as r = t^ v = inf{t > : Z^(t) — Z v (t)}. In order to investigate 
the coupling time, it is convenient to use the stopping times r = 0, r k = inf{t > r k _i : 
Z^(t) — Z v (t) 7^ Z^ijk-i) — Z v (Tk-i)}. These times are the times when something interesting 
happens, i.e. one of the non-common points of Z% and Z T] dies or there is a birth in just 
one of the processes. Let us call the event that a non-common point dies a "good death" 
and the event that only one process has a birth a "bad birth". Assume that there are n 
non-common points in £ and rj, i.e. ||£ — T)\\ = n, where ||-|| denotes the total variation norm 
for signed measures. Define the event A n = {\\Z^(ti) — Z n {T\)\ = n — 1} and the filtration 
T t = {J s<t cr(Z^(s), Z v (s)). An easy calculation gives us the following probabilities. 

P("good death" at time T 3 \ F Tj _,) - W^Tj-i) ~ 



P("bad birth" at time Tj | F T ) = 



Amax(^(7j-l), Zr)(Tj-l)) + \Z^(Tj_i) U Z v (Tj_i)\ 
Amax(^(7j-l), Z^Tj-i)) — A m i n (Z^(Tj_i), Z^Tj-i)) 



Lemma 5.2. The probability of the event A n is bounded from below as 

F(A n )>(l + - sup f \X(x I £') - X(x I r)')\ ct(dx)) >0. (5.2) 

V n \\t'-r/\\=nJX J 
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Proof. We argue in terms of the discrete time Markov chains (Z^(Tj))j^z + and (Z v (Tj))j e % + 
and refer to them as the jump chains of our SBDPs. Define the N-valued random variable 
J by T\{oj) = Tj^j)(u) as the index of the first interesting jump. Then 

F(A n ) = V / P(A„ | J = j, Z^Tj^) = Z v (Tj_i) = rf) 

P( J = j, Z{(Tj-i) G Zri(Tj-\) e <V) 

oo „ 

= £ / P(||^(T,)-^(T,)||=n-l| ||Z c (T 4 )-Z,(T0||=nVi<j-l, 

||^)-^)l|e{n-l,n + l}, 

P( J = j, Z^T^) e d£, Z^Tj^) G drf) 



E 



n 



j^Jwn+ (A max («f, 77') - A min (f , V)) 

P( J = j, Z^(T j _ 1 ) e d£', Z^T^) G drf) 
n 



E 



> 



n + (A max (Z ? (T J _ 1 ), Z v (Tj-i)) - A min (Z 5 (Tj_i), Z v (Tj- X ))) 
n 



(A max (e,^)-A m in(e,^)) 

The claim follows since 



A max (£,??) - Kmn(£,v) = / \K X I - K x I V) \ a(dx), 



X 



which is uniformly bounded in £ and r\ by Condition (S). □ 

Theorem 5.3. For all configurations £,77 the coupling time t^ iV is integrable. In particular 
if £ and r\ differ in only one point, we have for any n* G N U {00}, 

ET£ ,,, ( „._ 1)!( £ r (lfi + ^lff! ds Vi±£^|, ( , 3) 

» i=n* i=n* / i=l 



where 



e = sup / I A (re | £) — A(x | ?7)| ct(dx) and 
\\f-n\\=i Jx 



\\t-*i\\=iJx 

c = c(n*) = sup / |A(x I £) — X(x \ rf)\ a(dx), 

\\£-v\\>n*Jx 

with the interpretations detailed in Proposition \3.1[ The constants e and c are finite by 
Condition (S). 



25 



The following Lemma treats the case n* — 1 and will be useful for the proof of Theo- 
rem [5731 

Lemma 5.4. For all configurations £,77 the coupling time r^ iV is integrable. In particular if 
£ and n differ in only one point, we have 

e c - 1 r e s - 1 

Er^ < + / ds, (5.4) 

c Jo s 

where c = sup^^rH J|A(a:|£) — A(x|n)| et(dx), which is finite by (S). 

Proof. Let p n — (1 + c/n)' 1 , n > 1. Construct a new pure-jump Markov process (Y(t)) t > 
on Z + by the following rule. Given Y (t) is in state n e Z + , after an exponentially distributed 
time with mean 1/n it jumps to n — 1 with probability p ra and to n + 1 with probability 
1 — p n . Define stopping times f n = inf{t > : Y n {t) = 0} for all n > 0, where (Yi(^))t>o 
denotes the process started at n. 

We show that r^ n is stochastically dominated by f n , and therefore Er^ j?? < Ef n . Denote 
by X = \\Z{ — Z v \\ the process counting the non-common points and define a new jump 
process Y' by the following construction. Set Y'(0) = X(0). If X and Y' are on the same 
level, say n, they move together. If X jumps, then Y' jumps with probability p n to n — 1 
and with probability 1 — p n to n + 1. Since P(A n ) > p n , the jumps can be coupled such 
that Y' stays above X. If they are separated, let Y' behave like Y until they meet again. 
Thus we have Y'(t) > X(t) for all t > 0, and hence t^ v is stochastically dominated by 
f' n = inf{t > : Y^it) = 0}, where (Y^(t)) t > denotes the process Y' started at n. 

Note that Y' has the same transition probabilities as Y, but its holding times are some- 
times those of X. Since the processes Z^ and Z v have unit per-capita death rates, each of 
their non-common points dies independently after a standard exponentially distributed time. 
If there are n such points, the minimum of these times is exponentially distributed with mean 
1/n. Hence the holding times of X at n, and therefore also all of the holding times of Y' at 
n, may be coupled with exponentially distributed random variables with mean 1/n that are 
almost surely larger or equal. This yields a coupling of Y' and Y with exactly matched jump 
chains where Y is just a slower version of Y'. Hence f' n is stochastically dominated by f n . 

Define now e n = Ef n , n > 0. Then, by conditioning on the next jump in the Y-chain, we 
obtain 

i-l + - )pn + fe n+ i + -)(!- p n ) 



nJ \ ri/ 

= e n -xp n + e n+ i(l - p n ) + -. (5.5) 

n 

If c = 0, we have e n = e n ~i + 1/n = Y^7=i V z 'j m particular e\ = 1. For c > 0, define 
o n = e n — e n _x, n > 1, assuming that the e n are finite. Then p n = (l + c/n)~ 1 yields 

nil 

a n+ i = — a n for all n > 1. (5.6) 

c c n 

Since eo = 0, the starting point is a\ — e\. The general solution of (15. 61) is given by 



1 + — +C^^, (5.7) 
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for an arbitrar y constant C 6 



A result in iGrimmett and Stirzakerl (120011 ) [Exercise 6, p. 265] states that the sequence of 
expected return times is the smallest non-negative solution of (I5.5P , which yields that the e n 
are in fact finite and given by setting C = 0. We obtain for all n > 1 that 



e n = X ° n - nCLl = nei > 



(5i 



i=i 



and 



e\ = ai 



£(1 



i=0 



l + i 



e c - 1 



ds. 



Note that ei — >■ 1 for c — >■ 0. 



□ 



Proof of Theorem \5.3l Let £ and 7/ be point configurations differing in n points, i.e. they can 
be written as 



i = c + 5 vi and v = c + X 5 y 

i=l i=k+l 

where < k < n, yi, . . . , y n are the non-common points of £ and rj, and £ = £ fl 77. Then 

fc j j'-i 



|A(x|0-A(x|t7)| 



i=fc+i 

By the triangle inequality we obtain 



j=l i=l i=l 

n j j—1 

X KH C+ £ |c+ E *«) 



=fc+i 



i=k+l 



sup / |A(x I £) — A(x I 7^)1 Q:((ix) < n sup / \X(x | £) — X(x \ 77) | ct(dx) = ne. 

\\t-r,\\=nJx ' \\e-i,\\=lJx 

Thus by Lemma [531 we have ¥(A n ) > + for n > 1 and P(A n ) > (l+c/n) for n > n*. 
Assume e > 0, c > and n* e N. Replace the jump-down probabilities p n of Y in the proof 
of Lemma \5 .41 by the above bounds. We then obtain for the differences a n = e n — e n _i of the 
expected return times to zero the recursion equations 



a n+l — ~ a r. 



for 1 < n < n* 

ne — 
1 1 



a n+1 = - c a n - - - - for n > n*. 
The differences for larger n must still be the same. Hence the proof of Lemma 15.41 gives 



(5.9) 



/ j y-j l 

1=0 



ni= (" + k) 



1 + 



c 



n + i 



for all n > n*. The second recursion equation in ( 15. 9 p is best solved backwards, which yields 
the solution 

a n *- k = e k a n , + (1 + e) 



i=i 



rr — i 
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for all 1 < k < n* - 1. Thus 

ei = ai = £ n*-l\-^_^ 1 + M + — V- :, (5.10) 

which can be rewritten as (15. 3p . 

Letting n* — )■ 00 in the inequality Er,c >J? < ei, we obtain Er^ i?? < 3d£ T ^ £ < ^ 
irrespective of c, which justifies setting the long first summand in (I5.3P to zero. Analogously 
as in the proof of Lemma 15.41 we get for e = (which implies c = 0) that e\ = 1, and for 
e > 0, c = that a n * = 1/n*, which justifies the interpretation of the upper bound in (15.31) 
in the limit sense. 

If c does not depend on n*, we choose n* such that (1 + e)^ 1 > (l + c/n) -1 for n < n* 
and (1 + e)^ 1 < (1 + c/n)^ 1 for n > n*. This is obviously the optimal choice and leads to 
n* = \c/e] . □ 

One can also couple SBDPs with random starting configurations. It is convenient to 
use notation of the form Z& also if H is a point process with the obvious meaning that 
(Zs(t))t>a is an SBDP with generator (15. ip and Z(0) = H almost surely. Since this may lead 
to confusing notation when dealing with two processes, we always distinguish the processes 
by adding a prime, thus writing and (Z~(t)) for point processes 5 and S. 

Proposition 5.5. Assume that H and H are Gibbs processes satisfying (S). Consider the 
coupling (Z^(t) , Z~(t)) , t>0. Then the coupling time r s = = inf{t > : ^s(t) = Z~(t)} is 
integrable. 

Proof. The Georgii-Nguyen-Zessin equation and Condition (S) yield 

E|S| = E / 1 E(dx) = E / i/(ar I H) a(dar) < 00, 

where v is the conditional intensity of S, and analogously E|H| < 00. Then (15.81) implies 

Er 3>§ = E(E(r s s | |S|, |H|)) < E(E(e |H|+|g| | |S|, |H|)) < eiE(|S| + |S|) < 00. 

□ 

In particular the coupling time t^h for (non-random) £ G 9T and H is integrable. 



6 Stein's method for Gibbs process approximation 

Stein's method, originally conceived for normal approximation ( Steinl . Il972l ). has evolved 
over the last forty years to become an important tool in many areas of probability theory 
and for a wide range of approximating distributions. See iBarbour and Chenl (120051 ) for an 
overview. 

A milestone in the evolution of Stein's method was the discovery in IBarbour! (119881 ) 
that a natural Stein equation may often be set up by choosing as a right hand side the 
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infinitesimal generator of a Markov process whose stationary distribution is the approximat- 
ing distribution of interest. Many important developments stem from this so-called genera- 
tor app roach to Stein's method, an d several of them conce r n poi nt process appr o ximat ion, 
such as iBarbour and Brown! (119921 ) , iBarbour and Manssonl (120021 ) , ISchuhmacherl ( 120091 ) , or 
Xia and Zhand (120121 ). " 

In this section we develop the generator approach for Gibbs process approximation. Let 
H ~ Gibbs(A) be our approximating Gibbs processes satisfying (S). Define the generator 



Ah(0= [ [h(Z + 6 x )-h(S)]\(x\Z)a(dx)+ [ [h(£ - 5 X ) - h(0] adx) 
J X Jx 



(6.1) 



for all h: 91 — > R in its domain S>{A). In Section Owe noted that A is the generator of 
a spatial birth-death process Z with stationary distribution Gibbs(A), and that its domain 
contains at least all functions h with bounded first differences. 
We set up the Stein equation as 

f(0-Ef(E)=Ah(0- (6.2) 
By analogy to the Poisson process case a natural candidate for the function h is given by 

POO 

h f {0 = - / [Ef(Z ( (t)) - E/(H)] dt. (6.3) 
Jo 

The following lemma shows that hf is really a solution to Equation (16.21) . 

Lemma 6.1. Assume that f is bounded and H satisfies (S). Then hf is well-defined, i.e. 
the integral exists for all £ G 91, and it is a solution to (16.21) . 

Proof. Since =^(H) is the stationary measure of the SBDP Z, we have E/(H) = Kf(Z^(t)) 
for all £ > 0. Thus, using the coupling from Section [5], 



\Ef(Zz(t))-Ef(K)\dt 



E\f(Zt(t))-f(Z^(t))\l{Tt#>t} dt 



POO 

<2\\f\\ OQ ¥(r^ H >t)dt 
Jo 

= 2||/|| 00 E(r €iH ) < oo 



by Proposition 15.51 Hence hf is well-defined. The Markov property of the SBDP implies 
% (Z' z w (t)) = Sf(Z((t + s)). Thus by the substitution v = t + s 

i i / r°° 

(Eh f (Zs(s)) - h f (0) =- - / E[f(Zs(t+s))-f(K)]dt 
s \ Jo 

/oo 
E[/(Z c (t))-/(H)] dt 

E[/(Z f (t;))-/(H)] dv. 



29 



By condition (S) we have F(Z^(v) ^ £) = 0(f) and since / is bounded, Ef(Z^(v)) 
/(£) + 0(v), which implies 



Thus 



Ef(Zt{v))dv = f(t) + 0(s). 



Ahf{£) = lim- [Eh f(Z^(s)) - h f (0) = /(O - E/(H). 



Define the Stein factor as 

Ci(A) = sup sup + <y x ) - 

We are now ready to give proofs for the results in Subsection 13.11 



□ 



(6.4) 



Proof of Proposition ^. 11 Consider the coupling (Z^ + s x (t), Z^(t)) t >o in the sense of Section [5j 
Then by Equation ( I6.3p . 



\hf(t + 6 x )-h f (0\ 



POO 

E / [f(Zs +Sx (t)) - f(Z^t))] l{r e+ ^ > t} dt 
Jo 

POO 

< sup 1/(0-/(^)1/ > t) dt < Etz +s ^ 

\\e-v\\=i Jo 



where we used that < / < 1. The upper bound on ci(A) follows now from Theorem l5.3l □ 
Proof of Theorem \3.5l By the Stein equation (16. 2 j) 

drv(Jgf(S),JSf(H))= sup |E/(S)-E/(H)|= sup |IU/i/(S)|, 

where ^ is the Stein solution given in ( 16. 3p . Then the Georgii-Nguyen-Zessin equation (12. 3p 
yields 



\EAhi 



E / [h f (E + 5 x )-h f (E)]\(x\E)a(dx)+E ! [h f (E - 5 X ) - h f (E)] E(dx) 

J X J X 

E / [h f (E + 8 x )-h f (E)](X(x\E)-u(x\E))a(dx) 
Jx 

< sup \hf(g) - h f (rj)\ / E\u(x \ E) - X(x \ E)\ a{dx) 

<ci(A) / E\v{x\E) - A(x|S)| oc(dx). 
Jx 



(6.5) 
□ 
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Remark 6.2. In the proof above the sup^^ 11^-^11=1 could actually be replaced by an 
essential supremum with respect to JSf (S) + J?f (H). This can be seen as follows. Let N G M 
be a null set with respect to both Jzf(S) and «if(H). Without loss of generality we may set 
the densities of 5 and H to zero on N. By the hereditarity of the densities we have 

[h f {E + 5 x )-h f {E)]{X{x\E)-v{x\E)) 

= [hf(E + 5 X ) - h f (E)] 1{E + 5 X G N c , E G N c }(X(x \ E) - v{x | S)), 

whence it follows that the first inequality of ( 16. 5 p holds also for the essential supremum. 
A consequence of this replacement is that it suffices to take the essential supremum with 
respect to ^(E) + J?f (H) for the computation of the constants e and c in Proposition 13.11 

Proof of Theorem \3. 7[ Let v and A denote the conditional intensities of E and H. Then for 
i = YJLi $vi e ^ we nave 



v ( x 1 - K x 1 = p( x ) ( n y> > ~ n y } 

j=l \ ^i=l / H=j+1 ' M=l ' H=j 

i-i 



= P(x)^l(vifayj)-(p2(x,y j )^(Y[(pi{x,y i )j( JJ p 2 (z,2/. 
Therefore it follows by T heorem 13.51 ^ < 1 for i — 1,2, and Campbell's formula (see 



Daley and Vere-Joned . 12008k Section 9.5) that 



<M^(S),^(H))< Cl (A)EQf jf 



/9(ar)|^i(ar,2/) - <f 2 (x,y)\ a(dx) E(dy) 



c iW / / P(x)u(y)\<p 1 (x,y)-<p 2 (x,y)\a(dx) a(dy). 



x J x 



Regarding the bound on Ci(A) we have for all x, y G X and £ G 91 that A(x | £ + 5 y ) 
X(x I £ > )tp 2 (%,y) and A(x | £) < 0(x), with equality if £ = 0. Thus by Theorem 15.31 



e = 

\\t-v\\ 



sup / \X(x\^) — X(x\r])\ cx(dx) = sup / \X(x \ £ + 5 y ) — X(x \ £)\ et(dx) 
-n\\=iJx yex^emJx 



= sup / X(x\^)\(p 2 (x,y) — 1\ a(dx) — sup / /3(x)(l — <p 2 (x, y)) a(dx). 
y ex,^<nJx yexJx 

Furthermore, \X(x \ £) — X(x\r])\ < 0(x) for all x G X and for all £,77 G 91. Thus c < 



Appendix: the case of a non-diffuse reference measure a 

In the main part we restrict ourselves to a diffuse reference measure ot\ see Subsection 12.31 
This appendix shows that our results remain true for general a (always finite). 



31 



Suppose that ot is not diffuse. Consider then instead of X the extended space X = 
X x [0, 1] and equip it with the pseudometric d((x,u), (y,u)) = d(x,y). In the main part 
the metric d serves the double purpose of inducing the a-algebra on X and allowing us to 
define balls when it comes to more detailed considerations of conditional intensities. For the 
first purpose, which does not require an explicit metric, we just use the product topology 
on X; for the second purpose we use the pseudometric d. Note that the topology induced 
by d is coarser than the product_topology, so there are no measurability problems. Define 
5 = a®Leb|[o,i], and denote by Poi the distribution of the Poisson process with expectation 
measure a. Note that 5 is always a diffuse measure, and that a Poi-process is simple, i.e. 
almost surely free of multi-points. 

Transform a point process S = 5^»=i on % hito a point process 5 on X by randomizing 
its points in the new coordinate. More precisely let H = Yli=i ^PQA); wnere U u U2, ■ ■ ■ are 



i.i.d. Leb I [ 0j i] -distributed random variables that arc independent of 2. Following Kallcnbcrg 
( 119861 ). we refer to H as the uniform randomization of H. Writing 01 for the space of finite 
counting measures on X, we introduce the projection Ttx- Ot — >■ OX, 7r^ (X^jLi = 
X^Li'W Note that the image measure of Poi under 7r^ is Pox^ 1 = Poi. If H is a Gibbs 
process with density u with respect to Poi, then a short calculation shows that 5 is a Gibbs 
process with density 

u§(f) = «(tt*(|)) for any £ G OT 

with respect to Poi. 

Hence the conditional intensity of H is given by 

w/ niA w I /Ax 

A((x, u) I = -~— - = A(x I 7T*(£))- 

Let J"tv be the class of measurable functions /: Ot — > [0,1] for which /(£) = /(r/) 
whenever 7r^(^) = 7r^(ry). Furthermore let A be the generator of an SBDP on OT with birth 
rate A(- 1 •) and unit per-capita death rate. Note that 

ci(A) = sup sup \hf(£+6s)—hf(£)\ = c 1 (\). 

Then for any two point processes S and H on X we can show by slightly adapting the 
proof of Theorem 13.51 that 

(h v (Jf(E),Sf(R))= sup |E/(S)-E/(H)| 
= sup |E^7i/(H)| 

< ci(A) / E|z>((x, u) I 5) — X((x, u) I 5) I 5(d(a;, u)) 

= ci(A) / / E|^(x 1 7r^(H)) — A(x 1 7r^(H))| du a(dx) 
Jx Jo 



ci(A) / E\u(x\E) — X(x\E)\ cx(dx), 
Jx 
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where for the last equality the two expectations are the same by the transformation theorem. 

The upper bound for Ci(A) = &l(A) in Proposition 13.11 can be obtained analogously as 
before by bounding the expected coupling time between two SBDPs with generator A whose 
starting configuration differs in only one point. The expected coupling time will not be larger 
than before, because we may match the additional components u G [0, 1] of any new born 
points perfectly in the two processes. 

For the more gener al statement in Re mark 13.61 to hold, we have to replace Condition (E) 
by Condition (£'); see Kallenberg ( 1986h . Section 13.2. 



For the other results in Section [31 which are essentially corollaries of Theorem 13.5} it is 
easy to verify that we did not use the fact that a is diffuse, in particular not that £, rj e 
are multi-point free, except for notational purposes. 
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